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Given two independent samples of non-negative random variables with unknown distribution 
functions F and G, respectively, we introduce and discuss two tests for the hypothesis that F is 
less than or equal to G in increasing convex order. The test statistics are based on the empirical 
stop-loss transform, critical values are obtained by a bootstrap procedure. It turns out that for 
the resampling a size switching is necessary. We show that the resulting tests are consistent 
against all alternatives and that they are asymptotically of the given size a. A specific feature of 
the problem is the behavior of the tests 'inside' the hypothesis, where F ^G. We also investigate 
and compare this aspect for the two tests. 

Keywords: bootstrap critical values; empirical stop-loss transform; increasing convex order; 
one-sided two-sample tests. 

1. Introduction 

One of the basic problems of actuarial mathematics and mathematical finance is the 
comparison of risks; see, for example, Kaas et al. (1994), Miiller and Stoyan (2002) and 
Rolski et al. (1999). In order to introduce the aspect that we will deal with suppose 
that the random variables X and Y represent the loss associated with two insurance 
policies or other financial contracts; we assume throughout the paper that the random 
variables are non-negative and that they have finite mean. Clearly, ii X <Y then the 
comparison is a trivial task. However, the random quantities X and Y will in general 
not be directly comparable, and indeed, the comparison of risk is normally based on the 
respective distribution functions F and G of X and Y . 

A classical partial order for distributions is the stochastic order: We say that F is 
less than or equal to G in stochastic order and write F <st G (or, with some abuse of 
notation, X <st Y) if 

l-F{x)<l-G{x) foraUa;>0. (1.1) 
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Many interesting and useful results are known for this concept, especially for its statis- 
tical aspects such as testing the hypothesis that F <st G; see, for example. Chapter 6 
in Conover (1971) or Kapitel 3,4 in Witting and Nolle (1970). However, in the context of 
risk comparison, stochastic order is too restrictive and it does not capture the important 
notion that risk should also depend on variability. For this and other reasons alternative 
notions of partial order for distributions have been investigated extensively, and the in- 
creasing convex order has turned out to play a major role in this application area. There 
are several equivalent definitions: We have F <icx G if 

Ef{X) < Ef{Y) for all / £ T-,,^, (1.2) 

where J^icx denotes the set of increasing convex functions / : M, or if 

E{X - t)+ < E{Y - t)+ foralH>0, (1.3) 

see, for example. Theorem 1.5.7 in Miiller and Stoyan (2002). Whereas (1.2) explains the 
terminology, (1.3) has the obvious interpretation in terms of reinsurance and stop-loss 
contracts. In fact, a straightforward application of Fubini's theorem leads to yet another 
condition. For this, we introduce the stop-loss transform SL(_F) or F^^ associated with 
a distribution function F, 

/oo 
{1-F{x))dx foralH>0 (1.4) 

{F^^ is the integrated survival function; we prefer the notation SL(i^) whenever we 
want to emphasize the operator interpretation of the transform). Then the following is 
equivalent to (1.2) and (1.3): 

F^^{t)<G^^(t) foralH>0. (1.5) 

Indeed, the increasing convex order is often called stop-loss order, especially in an actu- 
arial context. Obviously X <icx Y ii X <st Y and EY < oo; hence, for random variables 
with finite mean, increasing convex order is less restrictive than stochastic order. Further, 
by Jensen's inequality, we always have X <icx Y ii X = EY. This captures the fact that 
in an actuarial application, a fixed loss of magnitude c would always be preferable to 
a random loss with mean c. In stochastic order distributions with the same mean are 
comparable only if they are identical. 

When comparing actual risks an assertion such as X <icx Y may result from general 
considerations or may be the consequence of some model assumption. Here we take the 
view that data from previous contracts could serve as the basis for such a statement and 
we introduce two statistical tests that are applicable in this context. Such tests also have 
an obvious role in connection with model checking. 

Formally, we assume that we have two independent samples Xi, . . . , X^ and Yi, . . . , l^j, 
where the X-variables are independent and have distribution function F and the Y- 
variables are independent and have distribution function G, and we consider the one-sided 
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composite hypothesis F <icx G against the general alternative F ^icx G. In the special 
case where the distributions F and G are discrete and are concentrated on a fixed finite 
set the likelihood ratio test can be applied; this has been investigated in Liu and Wang 
(2003). Our approach in the general case is based on the plug- in principle. Let 1a denote 
the indicator function associated with the set A. Regarding the empirical distribution 
functions 

^ m \ ^ 

^"' = — and G„ = -^l[Y,,oo), 

3=1 k=l 

of the two samples Xi,. . . , Xm and Fi, . . . , 1^ as suitable estimators for F and G, we are 
led to estimate F^^ and G^^ by the respective empirical stop-loss transforms F^^ and 
G^. (We note in passing that, as SL is one-to-one, F^ is the nonparametric likelihood 
estimator for F^^ in the same sense as Fn has this property as an estimator for F; see, 
for example, Shorack and Wellner (1986), page 332f.) In view of the close relation of the 
present problem to the analogue for the classical stochastic order (see also Sections 3.1 
and 3.5 below) natural candidates for test statistics arc then the one-sided Kolmogorov- 
Smirnov test statistic for increasing convex order, 

t>0 

and the one-sided Cramer-von Mises test statistic for increasing convex order, 

poo 

T^:,r=.^™,„/ {F^t{t)-Gi^{t))+dt, (1.7) 

Jo 

where we have used the abbreviation 

/ mn 

Km,n '■= \ ; — (l-°) 

\j m. + n 

for the multiplicative constants that are needed in order to obtain non-trivial limit dis- 
tributions; see Section 2.2 below. In contrast to the classical stochastic order situation 
these statistics are not distribution-free, so a practical way to obtain critical values is 
needed. We propose to use the bootstrap; as we will see, this leads to a problem-specific 
variant that is a consequence of the nature of the hypothesis and that docs not seem to 
arise in other situations where bootstrap tests have been suggested. 

We have used the comparison of risks as a general motivation, but even within an actu- 
arial context there arc many different and specific applications of increasing convex order. 
An excellent source is the recent book by Denuit et al. (2005), where this order is used 
in connection with premium calculation, option pricing, the modeling of dependencies 
and the comparison of claim frequencies. Another important application area is renewal 
theory, where the stop loss transform of a lifetime distribution arises in connection with 
the stationary delay distribution and the limit distribution of the forward and backward 
recurrence times. Renewal theory in turn is a basic building block throughout the whole 
area of stochastic modeling. 
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In the next section we state our main results. Section 3 contains two examples and 
the results of a small simulation study; we also look at the practical calculation of the 
test statistics and we consider the relation to stochastic order in more detail. Proofs are 
collected in Section 4. 

2. Main results 

We first introduce some notation and list our general assumptions. Then we investigate 
the asymptotic behavior of the test statistics, both for situations where the hypothesis 
is true and situations where it is not. Our variant of obtaining bootstrap critical values 
is the topic of the next subsection. Finally, wc combine these results into a description 
of the asymptotic behavior of the overall procedure, that is, where the test statistics are 
used together with the bootstrap critical values. 

2.1. Preliminaries 

In our main results a particular class of stochastic processes will be important: For a 
distribution function H on R_|_, let Bh — (Sff(i))t>o be a centered Gaussian process 
with covariance function pn, 

{H{uAv)~H{u)H{v))dudv (2.1) 

for all s,t > 0. Such a process can be obtained from a standard Brownian bridge B = 
(i?(t))o<t<i, which is a centered Gaussian process with covariance function 

cov{B{s), B(t)) = s At - s ■ t, 

via 

/oo 
B{H{s))ds foraUt>0. (2.2) 

In particular, a process Bh with the above properties exists if / x^H{dx) < oo (the 
role of moment conditions will be considered in more detail in the proofs section). The 
representation (2.2) could also informally be written as Bh ~ SL(i? o H). In view of the 
facts that, first, y/rn{F,n — F) converges in distribution to B o F and, second, that SL is 
a linear operator it is not surprising that processes of this type appear in the context of 
distributional asymptotics for empirical stop-loss transforms. 

The covariance structure of B o H matches that of (l[2,oo) it))t>Oi where Z is a random 
variable with distribution function H. This (or a simple direct calculation) yields the 
alternative representation 



PH(s,t) = cov((Z-s)+,(Z-i)+) for all s,t>0 



(2.3) 
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for the covariance function of Bh- In the proofs both (2.1) and (2.3) will be useful. We 
will also occasionally find it useful to extend these processes to the compactified half- 
line by Bh (oo) :~ 0; a similar convention is used for the stop-loss transforms. Moment 
assumptions will imply that Bh exists (as already mentioned above); that the paths 
are continuous (and hence bounded) functions on the compact set K := [6,00], which 
is important in the Kolmogorov-Smirnov situation; or that these paths are integrable 
functions on K , which is necessary in the Cramer-von Mises case. 

Let F and G be distribution functions on with finite mean and let (Xj)j^n and 
(yfe)*;eN be two independent sequences of independent random variables, where the Xj's 
have distribution function F and the Y^'s have distribution function G. Our convergence 
results refer to statistics Tm,n that depend on the first m of the X-variables and the first 
n of the F-values, where we generally assume that 

in 

minjm, n| — > 00, >Te[0, ll. (2.4) 

^ m + n ^ ' ^ ^ 

We write T,„ „ — >distr T if Tm,n converges in distribution to T. 

In order to describe the behavior of the tests inside the hypothesis we need two more 
definitions. Given F and G, 

A{F,G) ■.= {te[0,oo]:F^^it)^G^^it)} (2.5) 

denotes the set where the two stop-loss transforms are equal. Our general conventions 
imply that A{F, G) ^ 0. Let H := {1 - t)F + tG, with t as in (2.4). Then 

j{H):^snp{te[0,oo):H{t)<l} (2.6) 

is the right end-point of the support of H. Finally, we generally assume that F and G 
are not degenerate and that the significance level a is less than 1/2. 



2.2. Asymptotic behavior of the test statistics 

The first theorem in this subsection gives the limit distributions of T,^^j and in 
the case where the hypothesis is true. 

Theorem 1. Suppose that F <icx G and that G has a finite second moment. Then, with 
H = (1 — t)F -\- tG and Bh as in Section 2.1, 



teA(_F,G) 



r,„fn,Tt 7^^':= sup Bnit). 

distr 



// j x'^^^G{dx) < 00 for some e > 0, then 



T^:^,r^T^^^:= / B+it)dt. 



distr 



A{F,G) 



104 



L. Baringhaus and R. Griibel 



On the alternative we have the following behavior. 

Theorem 2. Suppose that F ^icx G. Then T^s„ and T^^^ converge to oo with proba- 
bility 1. 

2.3. Bootstrap critical values 

The step from a test statistic T to an actual statistical test cj) = l^j'>c} requires a critical 
value c = c(a) that depends on the chosen error bound a for the probability of wrongly 
rejecting the hypothesis. For stochastic order, due to the strong invariance properties 
and their consequences, the classical tests are level a tests even for finite sample sizes. 
In the present situation, however, we have to content ourselves with asymptotics, aiming 
for a procedure that satisfies the error bound asymptotically as the sample size(s) tend 
to infinity. At first, one might think of using Theorem 1 in the special case F ~G with 
the quantiles of the respective limit distribution as critical values. However, these limit 
distributions depend on the unknown F; also, it is not clear what the consequences are 
if we are inside the hypothesis in the sense that F <icx G, but F ^G. 

In this context resampling methodology provides a practicable solution. Recall that Fm 
and Gn are the empirical distribution functions associated with the samples Xi , . . . , Xm 
from F and Yi, . . . ,1^ from G. These random variables are the initial segments of two 
infinite sequences {Xj)j^^ and (yfe)fceN of random variables defined on some background 
probability space (fi,^, P); the almost sure statements below refer to P. Given the 
initial segments, let Zm,1: ■ ■ Zm,n be a sample of size TV :— m + n from the (random) 
distribution function 

F^rn^n \ F^a ; Gji- (2.7) 

in + n m + n 

Note the 'size switching': In contrast to many two-sample bootstrap tests in the literature 
the basis for the resampling that we use here is a distribution that assigns higher prob- 
abilities to the values from the smaller sample (this aspect will be further discussed in 
Section 3.2). The following theorem shows that, with probability 1, the limit distribution 
for the respective test statistics is the same as in Theorem 1 in the special case F = G 
(so that A{F,G) = [0,oo]). 

Theorem 3. With Zj^^i, . . . , Zn^n o,s above, let F^^m o-nd GN,n be the empirical dis- 
tribution functions associated with Z^ri, . . . , Zjv,m o,nd Zf4_„i+i, •■•,^Af,Af, respectively. 
Let 

Tl% ■■= ^n.,n sup(F^L^(t) ^ Gj^,^{t)) (2.8) 

t>0 

and 

poo 

T^:n--=^m,n {Ff^,m{t)~Gf^n{t))^dt (2.9) 

^0 
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be the bootstrap versions of the Kolmogorov-Smirnov and the Cramer-von Mises test 
statistics. Let Bh be as in Section 2.1, with H = {1 — t)F + tG. Then, with probability 
one, 

snpBHit). (2.10) 

distr t>o 

// J x^''^^ F{dx) < oo and J x^'^^G{dx) < oo for some £ > 0, then, again with probability 
one, 

/>oo 

tn:^r^^ / B+{t)dt. (2.11) 
distr Jq 

The distributions of T,^^ and T^^^ are functions of the data Xi, . . . , Xm, li, . . . , y„. 
The famihar Monte Carlo procedure can be used to obtain approximations for the re- 
spective distribution functions and quantiles. 



2.4. The bootstrap tests 

Let T^^ and T'~^^^ be as in Theorem 1, and suppose now that F = G. In particular, 
the supremum and the integral are taken over the whole half- line. Let c^^{F,a) and 
c'~^^^{F,a) be the associated upper a-quantiles, that is, 

^KS|c.A/(^^^) := inf{(?> 0:P(rKS|C"^^ > q) < a}, (2.12) 

and let cj^^„(a) and c^^^(a) be the corresponding bootstrap estimates, that is, the upper 
a-quantiles for and T^Jf , respectively. We now consider the procedure that arises 
from using the test statistics defined in Section 1 together with these critical values. The 
following theorem is the main result of the paper. Our proof will give a slightly stronger 
result; we concentrate here on the statistically most relevant aspects. Let £ denote the 
Lebesgue measure. 



Theorem 4. Let F and G be distribution functions with finite second moment, put 
S := A{F, G) n [0, 7(iJ)). Then the limit 



exists, and 



iPKsia; F,G):= lim PiT^% > c'^^Ja)) 

ni.n — 'OO 



iPKs{c^;F,G)=a ifF^G, 

iPKs{a;F,G)<a ifF<,,^G, 

■0ks(q^; F,G) — if and only if F <icx G and 5 = 0, 

iPKsia;F,G) = l ifF^,,^G. 
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If J x'^+''F{dx) < oo an: 




exists, and 



V'CvM(a;^, G) = a 
ipcvuia; F,G) < a 
i>CvM{a;F,G) = 
ipCvMia; F, G) = 1 



if and only if F <icx G and £{S) = 0, 
ifF^ic^G. 



ifF = G, 



In words: The tests are asymptotically exact if the two distributions are the same; 
they are asymptotically of the preassigned level; they are consistent inside the hypothesis 
(where the meaning of 'inside' depends on the type of the test), and they are consistent 
against all alternatives. Whereas the Cramcr-von Mises test requires a stronger moment 
assumption than the Kolmogorov-Smirnov test, it is asymptotically of level zero on a 
larger subset of the hypothesis. In particular, this 'consistency inside the hypothesis' can 
be used to distinguish between the two tests. 

3. Examples, simulations and remarks 

We first discuss the use of rank tests in Section 3.1. An example in Section 3.2 shows 
that the size switching in the resampling part mentioned after (2.7) is important; this 
example also illustrates the different behavior of the two tests inside the hypothesis. In 
Section 3.3 we provide two formulas that can be used to calculate the test statistics. 
Section 3.4 contains the result of a small simulation study. In the last subsection we 
show that, from an abstract statistical point of view, increasing convex order differs in 
an important way from the classical stochastic order. 

3.1. Unsuitability of rank tests 

What happens if we use the classical procedures designed for stochastic order, such as the 
Wilcoxon test and the Kolmogorov-Smirnov test, in connection with increasing convex 
order? 

We write Wei(/3) for the WcibuU distribution with parameter f3 > and Exp(A) for 
the exponential distribution with parameter A > 0; the respective distribution fimctions 
are F{x) = 1 — exp(— a:^) and G{x) = 1 — exp(— Ace), a; > 0. The stop-loss transforms (in- 
tegrated survival functions) of the Wci(2) and the Exp(l) distribution are 



i^^L(i):=: V7t(l-$(V2^)) and G^^(0 = exp(-t), t>0, 
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where $ denotes the distribution function for the standard normal distribution. Figure 1 
shows the two distribution functions on the left and the two stop-loss transforms on the 
right. Obviously, these distributions are not comparable in the usual stochastic order. 
However, we do have F <icx G in view of the Karlin-Novikov criterion (the mean of F 
is less than or equal to the mean of G and, for some to, F{t) < G{t) for all t <tQ and 
F{t) > G{t) for aU t > to] see, e.g.. Theorem 3.2.4 in Rolski et al. (1999)). 

Suppose we use the one-sided Wilcoxon test for testing <icx G against the one-sided 
alternative G <icx F , and the one-sided Kolmogorov-Smirnov test (in its version designed 
for stochastic order) for testing F <icx G against the general alternative F ^icx G. The 
first test rejects the hypothesis if Wm,n > Cm,„, where Wm,„ = YJk=i l{x,>Yfc} is 

the Wilcoxon-Mann-Whitney test statistic and Wm,n is the (1 — a)-quantile of Wm^n in 
the case where F = G. From Wm,nl [mn) ^1/2 and 

^ m n 

i=l fc=l 

in probabihty with tt^^g P{Xy > Yi) = /(I - F{x))G{i\x) it follows that P(W,„,„ > 
Wm,ix) ^1 as m, n ^ OO if t^f,g > \ - Since 

/>oo 

7rw(2).Exp(i) = / exp(-a; - x^) dx > 0.54 • • . > 5, 
Jo 

we arrive at the conclusion that in the case F — Wei(2) and G = Exp(l) (where the 
hypothesis is true) the probability of an error of the first kind of the one-sided Wilcoxon 
test tends to 1 as m,n^ 00. 




Figure 1. Distribution functions (left) and stop-loss transforms (right) for Exp(l) (solid line) 
and Wei(2) (dotted line). 
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The one-sided Kolmogorov-Smirnov test for stochastic order rejects the hypothesis if 



where fcm.n is the (1 — a)-quantilc of sup^>Q(G'„(a;) — Fn{x)) in the case where F ~ G. 
We always have km,n — > and, with F = Wei(2) and G — Exp(l) again, 



in probabihty as m,n-^ oo. Thus, for this test, too, the probabihty of an error of the 
first kind can become arbitrarily close to 1. 

The two tests considered above are rank tests in the sense that the respective test 
statistics depend on the data only via the ranks of the sample variables in the pooled 
sample. We have seen that such tests may violate the bound a for the probability of 
wrongly rejecting the hypothesis F <icx G if used in conjunction with the critical values 
designed for stochastic order. Moreover, rank tests (t>m,n for increasing convex order 
that do respect the significance level a < 1 (asymptotically) may fail to detect that the 
hypothesis is wrong, in the sense that they cannot be consistent against all alternatives. 
This follows from the fact that rank tests are invariant with respect to continuous, strictly 
increasing transformations of the data, that is. 



whenever F, G are the distribution functions of respectively, and X has 

distribution function F, Y has distribution function G. Indeed, if F, G and 5" are such 
that F <icx G, F ^icx G, we have the (asymptotic) upper bound a on the left-hand side 
of (3.1) because of the bound on the probability of an error of the first kind, whereas 
consistency would require the limit 1 on the right-hand side as m,n— > cx). An example 
for this situation can be obtained by choosing F and G to be the distribution func- 
tions for the uniform distribution on the intervals (0,1) and (7/16,9/16), respectively, 
together with '^{x) ~ ^/x. Then F <icx G follows with the Karlin-Novikov criterion, and 
a straightforward calculation shows that 



sup(G„(x) - F„(x)) > fc, 



sup(G„(a;) - F„(a;)) ^ sup(G(a;) - F{x)) > 




EF,G<t'm,n — Ep^^(j)„i,n, 



(3.1) 




which implies that -F ^icx G. 



3.2. Size switching does matter 



Suppose that the distributions of the X- and y-variablcs are given by 



P(X,=0) = P(X, = l) = i and PiYk^0)^l-P{Yk^2)^l, 
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respectively. The corresponding stop-loss transforms are 

^SL(,)^r(l-OA 0<t<l, G«L(,).|(2-t)/4, 0<t<2, 

Obviously, F <icx G and j4(i^, G) ~ {0} U [2,cx)]. In Section 2.3 we based the resampling 
on the distribution function 

--Tm H (j-n- 



m + n ?n + 71 

We now investigate, for the above F and G, what the consequences are if we resample 
from 

m + n m + m 

instead. This corresponds to the standard resampling scheme where each of the items 
from the pooled sample is chosen with the same probability, so that the subsamples 
are represented proportional to their size when using -ff^n- This procedure has been 
suggested in Chapter 3.7.2 of van der Vaart and Wellner (1996) in connection with the 
general hypothesis F = G, for example, but of course, on the boundary F = G of the 
hypothesis F <icx G the size switching is asymptotically irrelevant. 

Let H = {1 — t)F + tG and iJ" := tF + (1 — t)G be the respective limits of these 
distribution functions as m,n—f oo, where we again assume that (2.4) holds. Let Bh be 
as in Section 2.1 and let similarly Bfjo = {BHo{t))t>o be a centered Gaussian process 
with covariance function 



oo />oo 



PHo{s,t) = / / H°{uAv)dudv-SL{H°)is)SL{H"){t). 



By Theorem 1 the test statistics converge in distribution to 

TKS= sup BH{t) = B+{0), 

where we have used the fact that Bh vanishes on [2,oo]. By Theorem 3 the bootstrap 
estimators using size-switched resampling converge in distribution to 

T™'*'='^ = supBi/(t) 

with probability 1. The same arguments also work for standard resampling, leading to 
the distributional limit 

TP"P = supi?^o(0, 

t>o 

again with probability 1. Moreover, using the same arguments as in the proofs of Theo- 
rems 3 and 4 it can be shown that, with the if°-based estimators c'^^{a) for the critical 
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values, 

lim P(rKS„ > cOiKS(^)) ^ p(rKS > ^KS(^))^ 
m,n — »oo 

where c[J^^(a) is the upper a-quantile of the distribution of TP''°p. 

In the present simple case, where the distributions arc concentrated on just two values, 
and using the fact that Bh and Bj^o can both be constructed from a standard Brown- 
ian bridge B, the limit processes essentially reduce to two-dimensional normal random 
variables. This enables us to work out some details. In fact, with the temporary notation 
X := B{H{0)) = B{{2 + t)/4) and Y B{H{1)) = B((4 - t)/4) wc have that 

.00 ( {l-t)X + Y, 0<t<l, 

Bnit)^ B{H{s))ds= l{2-t)Y, l<t<2, 

[0, t>2, 

which gives T™"'^'' = X+ + Y+. As X and Y are jointly normal with aj var(X) = 
(4 - t2)/16, cr| := var(y) = t(4 - r)/16 and cov(X,r) = t(2 + t)/16 we obtain, again 
with $ the standard normal distribution function, and with p := cov{X,Y)/{ai<72), 

for all z > 0. Similarly, replacing r by 1 — r, we obtain TP'°P ^ X+ + Y+ , now with 
Xo := B{H^{0)) = B((3 - t)/4) and Fq ^(^^"(l)) = B((3 + t)/4). 

Table 1 gives the numerical results for some a- values in the case r = 3/4 (the entries 
in columns 2-5 are rounded to four decimal points). The second and third columns 
contain the limiting critical values based on size-switched and proportional resampling, 
respectively, that is, the upper a-quantiles c^^{a) of the distribution of T^""^^^^ and the 
upper a-quantiles c^^{a) of the distribution of TP''°p. In columns 4 and 5 we have the 
corresponding asymptotic probabilities of rejection. The error bound is clearly violated 
for each of the a-values in the proportional case. Due to the fact that T^^ = -Sj(O) = 
{X + y)+ and T'^™*'''' ~ X+ + y+ it is not surprising that in the case of size-switched 
resampling the limiting power is close to the given level for small a: Conditionally on 
X+ + Y+ being large, we have X,Y >Q and hence {X + Y)+ = X+ + Y+ with high 
probability. 

Turning to the behavior of the Cramer-von Mises test for increasing convex order we 
first note that T^vM J^^^ B^{t) dt ~ 0. The limiting critical values are the upper a- 
quantiles of B^{t) dt for size-switched resampling and ^'^^ proportional 

resampling. Since these quantiles are positive for each a G (0, 1/2), the limiting power of 
the Cramcr-von Mises test is for both resampling schemes. With F and G as above 
we therefore also have an example for the different behavior of the two tests inside the 
hypothesis. 
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Table 1. Critical values and error probabilities 



a 






P(T^s >c^S(q)) 


P(rKS>c?^(a)) 


0.100 


1.0134 


0.8069 


0.0999 


0.1537 


0.050 


1.3004 


1.0202 


0.0500 


0.0984 


0.025 


1.5495 


1.2079 


0.0250 


0.0633 



3.3. Alternative expressions for the test statistics 

The definitions given in (1.6) and (1.7) result from applying the plug-in principle. From 
a practical point of view it is desirable to have alternative expressions that avoid the use 
of integrals. Suppose that the set 

{0, Xi, . . . , X,n, Yi, . . . , Yn} 

has I different values and denote these, in increasing order, by Zi, . . . , Zi. 

Proposition 1. Let fj := f{Zj) for j ~ 1, . . . , where 

^ ni 1 ^ 

/(*):=— max(Xj ,t) V max(Yfe, t), 

m ^ — ' n ^ — ' 

j=i fc=i 

and let J and K be the complementary set of numbers j = 1, . . . , Z — 1 such that fj ^ fj+i 
and fj ~ fj-\-i, respectively. Then 

1<J<1 — 1 

and 

Clearly, these expressions can directly be used to implement the procedures on a com- 
puter. 



3.4. Simulations 

In order to obtain an impression of the performance of the tests we present the results 
of a simulation study. For each pair F and G of distributions chosen for the first and 
the second sample, respectively, an approximation of the power of the tests based on 
and T^^ was obtained empirically by simulations with 1000 replications. For each 
of these samples we used 1000 resamples to get the corresponding approximate critical 
values. The test statistics were calculated with the formulas given in Section 3.3. 
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Table 2. Estimated powers of the tests; significance level a — 0.05 



Distributions m = 50, n = 30 m = 30, n = 50 m = 50, n = 50 



rpKS rpCvM rpKS yCvM yKS j.CvM 



unif (0, 1) 

unif(if) 

Exp(l) 


unif (0, 1) 

unif(i,|) 

Exp(l) 


0.062 


0.065 


0.048 


0.048 


0.049 


0.050 


0.058 


0.059 


0.048 


0.044 


0.063 


0.063 


0.085 


0.099 


0.040 


0.027 


0.056 


0.052 


Wei(2) 


Wei(2) 


0.060 


0.070 


0.054 


0.049 


0.054 


0.057 


r(2) 


r(2) 
r(i) 


0.070 


0.090 


0.042 


0.034 


0.057 


0.060 


r(i) 


0.065 


0.093 


0.037 


0.023 


0.071 


0.064 


unif(il) 


unif (0, 1) 


0.060 


0.012 


0.054 


0.008 


0.052 


0.001 


Wei(2) 


Exp(l) 


0.035 


0.005 


0.011 


0.000 


0.016 


0.001 


unif (0,2) 


Exp(l) 


0.090 


0.044 


0.068 


0.011 


0.078 


0.013 


Exp(4) 


Par(5) 
unif(i,|) 


0.091 


0.088 


0.043 


0.021 


0.055 


0.037 


unif(0, 1) 


0.309 


0.371 


0.187 


0.251 


0.337 


0.378 


Exp(l) 


Wei(2) 
Exp(|) 


0.393 


0.549 


0.208 


0.270 


0.394 


0.517 


Wei(2) 
Exp(|) 


0.284 


0.117 


0.272 


0.061 


0.346 


0.087 


Wei(2) 


0.017 


0.056 


0.013 


0.017 


0.015 


0.036 


r(2) 


Exp(l) 


0.974 


0.887 


0.970 


0.819 


0.991 


0.926 


Exp(l) 


r(|) 


0.812 


0.642 


0.806 


0.493 


0.892 


0.634 



Throughout, the sample sizes are m, n € {30, 50}, and the significance level is a = 0.05. 
The distributions F and G are chosen from the families of exponential distributions 
Exp(A), WeibuU distributions Wei(/3), uniform distributions unif (a, 6) on finite intervals 
(a, 6), gamma distributions r{9) with shape parameter 9 > and scale parameter 1, 
and shifted Pareto distributions Par(77) with parameter rj > and distribution function 
1 — (1 + x)~^ for a; > 0. Taking into account the relationships 

unif (1/4, 3/4) <ic. unif (0, 1) <,t unif (0, 2) Exp(l), 

r(i/2) <3t Exp(i)<,tr(2), 

Wei(2) <icx Exp(l), 
Exp(4/3) <st Exp(l), 
Exp(4) <icx Par(5), 

three groups of pairs of distributions are considered. Table 2 collects the results, where a 
numerical value such as 0.062 in the upper left corner means that 62 of the 1000 samples 
have led to a rejection; the entries therefore approximate the power of the respective test, 
here the Kolmogorov-Smirnov test of the hypothesis that -F <icx G in the case that the 
samples are both uniformly distributed on the unit interval. 

The first group, lines 1-6 of the numerical values in Table 2, represents the boundary 
of the hypothesis, that is, cases where F = G. The two tests behave here essentially in 



Two-sample tests for increasing convex order 



113 



the same way. The given level a = 0.05 is exceeded slightly for sample sizes m > n and 
the tests are nearly conservative for sample sizes m < n. 

In the second group, lines 7-10, pairs from the interior of the hypothesis arc considered, 
where F <icx G and F ^G. For the first, third and fourth pairs of distributions it holds 
that £ A{F, G) and /^^^^ q-^ {t) dt = 0. Thus, the limiting power of the Kolmogorov- 
Smirnov type test is positive and that of the Cramer-von Miscs type test is 0. For the 
second pair of distributions A(F^ G) is empty and the limiting power of both tests is 0. 
Roughly, this behavior is reflected by the empirical power values. Results obtained for 
the special cases F = r(l/2), G = Exp(l) and F = Exp(l), G = r(2) are not shown in the 
tabic; here the empirical power value was found to be for each combination of sample 
sizes. 

The third group in lines 11-16 represents the alternative. None of the two tests turns 
out to be superior to the other one. 

All in all, the simulations indicate that the asymptotic results in Section 2 lead to a 
feasible solution for test problems concerning increasing convex order even for relatively 
small sample sizes. From a practical point of view and especially with respect to imple- 
mentation aspects, numerical stability, etc., our experience shows that the procedures 
arc solid and uncomplicated. Of course, the resampling requires considerable computa- 
tional effort, but with the current standard of hardware and software this is not really 
an issue. For example, we have done the entire simulation within the statistical environ- 
ment R (see http : //cran . r-project . org), which makes it possible to implement the 
procedures in less than 70 lines of code. For a specific data set with ri = m = 50 and 1000 
bootstrap resamples about 9 seconds were needed to calculate the test statistics and the 
critical value on a personal computer with an Intel Core 2 Duo processor running at 2.13 
GHz. Of course, the time required could be shortened considerably by implementing the 
procedures directly in a programming language such as C. 

3.5. Invar iance considerations 

One of the attractive properties of the classical tests for stochastic order, such as the 
Wilcoxon test and the Kolmogorov-Smirnov test mentioned in Section 3.1, is the fact 
that the distribution of the test statistics docs not depend on if is continuous and 
F = G. As a result, exact critical values for finite sample sizes can be obtained. Also, 
stochastic order relates well to the ordering of real numbers: X <st Y implies the existence 
of a pair X\ Y' of random variables with X' =distr X, Y' =distr Y and X' < Y' . 

At the heart of these properties is the fact that the full group of all continuous and bi- 
jective (and hence strictly increasing) transformations ^I* : Rj^ leaves the hypothesis 

F <st G invariant in the sense that 

X<stY ^ *(X)<,t*(y). (3.2) 

Such a strong property is not available in the present situation. Indeed, as the follow- 
ing theorem shows, only rescaling by positive constants leaves the hypothesis F <icx G 
invariant in the above sense. 
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Theorem 5. Suppose that ^':R_|- — s- R_|_ is continuous and bijective, and such that 

X<ic^Y ^ ^iX)<i,^^{Y) (3.3) 
for all bounded random variables X and Y . Then, for some a > 0, 

^(x) = ax for all x > 0. 

Hence the possibilities to reduce the test problem via invariance are limited. Of course, 
as we have already mentioned in Section 1 , the increasing convex order has many proper- 
ties that make it a suitable tool in connection with comparing risks; see also Chapter III 
in Kaas et al. (1994) or Section 1.5 in Miiller and Stoyan (2002). 

Whereas Theorem 5 may be regarded as a negative result as far as data reduction is 
concerned, it does say that the comparison of risks by increasing convex order will not 
be influenced by a change in monetary units, for example. It is straightforward to check 
that the tests in Section 2.4 are scale invariant in the sense that multiplying the data by 
a fixed positive constant does not affect the final decision, so our procedures respect the 
underlying symmetry of the problem. 

3.6. Outlook 

There arc two aspects of our investigations that may be of interest in other situations, too. 
First, whereas consistency against alternatives is a property that is routinely investigated 
in connection with the asymptotic analysis of statistical tests, a similar property for the 
hypothesis is not usually considered. In the context of increasing convex order we have 
seen that this property can be used to distinguish between competing procedures, much 
in the same way as in a situation where there are different sets of alternatives on which 
two tests are consistent. Of course, efficiency considerations are then the logical next 
step; we plan to do this in a separate paper. 

The other aspect, one that we have already pointed out repeatedly, is the fact that 
in order to obtain asymptotically correct critical values by resampling the resampling 
scheme has to be adapted to the hypothesis of interest - the standard procedure of 
generating artificial data by choosing values uniformly at random from the available 
observations may simply be wrong. 

4. Proofs 

Our main tool in connection with distributional asymptotics is the IIoffmann-j0rgensen 
theory of weak convergence, as described and developed in van dcr Vaart and WcUner 
(1996). 
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4.1. Proof of Theorem 1 

For each t > 0, let 0t:]R+ ^ K be defined by (jjtix) := (x - t)+ , and let T := 
{(t>t'-t > 0}. Then is a Vapnik-Ccrvonenkis class of index 2, see Lemma 2.6.16 in 
van der Vaart and Wellner (1996), and hence satisfies the uniform entropy condition. 
The fimction 0o is an envelope for JF. Obviously, as in the transition from (1.3) to (1.5), 

F^'-it) = j c^t{x)F^{dx), Gf{t) = J (t>t{x)Gn{dx) for all t > 0. 

We now consider the empirical processes U,^ ~ {Um[4'))<t>£r and = {U^{(j)))^^yr as- 
sociated with Xi,. . . , X„i and Fi , . . . , 1^ , respectively, 

UfM) - V^l / 0dF,„ - / 0dF ), U^icb) := V^f / <^dG„ - / cbdG 



for all (j) € J-. The processes can be regarded as functions on the underlying proba- 
bility space with values in loci^'), the Banach space of bounded real functions on 
endowed with the supremum norm. We avoid the technical difficulties arising from the 
non-separability of iooi^) by noting that, for every distribution function L with finite 
first moment, tt-^ J 4>tdL is a bounded and continuous function on [0,oo), and that this 
function can be extended continuously to the compactified half- line [0,oo] by oo 0, 
again endowed with the supremum norm. Hence the empirical processes (and their dis- 
tributional limits) all take values in a subspace of ^oo {^) that is a homeomorphic image of 
Co([0, cxd]), the space of continuous functions / : [0, oo] ^ K with the property /(oo) = 0. 
This subspace is closed and separable, and the empirical processes above are measurable 
with respect to the corresponding Borel a-field. Indeed, we will freely switch between the 
representation of the 'time parameter' of the processes as an element (p oi T ov as an 
element t of := [0,cxd]. 

By assumption, the envelope function is square integrable with respect to F and G. 
Hence the function class T is i^-Donsker and G-Donsker, so that 

as m,7i->oo, (4.1) 

distr distr 

where Bp = {Bp{<p))^^jr and Bq = [BG{4')),j}£j^ are centered Gaussian processes with 
covariance functions 

F{uhv)dudv-F'^^{s)F^^{s), 

/OO nOO 
G(uA«)d7.dz;-GSL(s)GSL(s), 

respectively, for all s,i > 0; see Sections 2.1, 2.5 and 2.6 in van der Vaart and Wellner 
(1996). We now define the stochastic processes Vm,n, m,n£N, by 

^m^n ■ — ^m^ni^F^-^ G^j ) 
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- - xl^U'r^ + -rnAF'"^ - G^^). (4.2) 

y 171 + n y 171 + n 

Note that, so far, the hypothesis F <icx G has not been used. By the mdependence of 
the subsamples, and using (2.4), we obtain 

J—^Uf^ - xl-^U^ ^ Vi^Bf - V^Bg =: B, 
y m + n y m + n distr 

where B is again a centered Gaussian process, with covariance function 

P= (1 - t)pf +TPG- 

Now let Bh be as in Section 2.1. A straightforward calculation shows that 

PH{s,t) = pis,t) + r(l - r)(FSL(s) - G^'^mF^'^{t) G^'^it)). 

In particular, when restricted to the (compact, non-empty) set A(F,G), the processes 
Bh and B have the same distribution. 

We want to apply the continuous mapping theorem to obtain the limit distributions 
for 

/>oo 

= sup V^,n (i) , T^:^ = / „ {t) dt 

t>Q Jo 

in the case that F <icx G. Intuitively, it is clear that for the supremum or the positive 
part the values outside A{F,G) are asymptotically irrelevant, as the last term in the 
basic equation (4.2) will tend to — oo for t ^ A{F, G) in view of K,m,n oo. A formally 
correct argument for T^^^-^ uses the Skorokhod almost sure representation together with 
the first part of the following auxiliary result. 

Lemma 1. Let fn, gn (n (zN), g, h be continuous real functions on K = [0,oo] such 
that fn^gn+Cnh, where (c„)ngN is a sequence of non-negative real numbers with 
linin^oo c„ = oo. Assume further that h < 0, that A := {h = 0} =/= and that gn con- 
verges uniformly to g. Then 

lim sup fn{t) = sup g{t). (4.3) 

and, for all r < oo, 

hm / f+it)dt= [ g+it)dt. (4.4) 

Proof. As the restriction of gn to the compact set A converges uniformly to the restric- 
tion of g to A we have 

lim supgn{t) = sup g{t). 



Two-sample tests for increasing convex order 



117 



Clearly, 



liminf sup /„(i) > sup g{t). 



Suppose now that (i,i)„gN C is such that /n(t„) = sup(g^/„(t). We can find a subse- 
quence (nfe)fe£N C N such that -^t^^K and 

lim sup /„j^ (i„ J = Km sup sup /„ ) . 

A uniformly convergent sequence of continuous functions is equicontinuous, hence 

lim g„^{t„J= lim (^o) = 3(^0)- 

k — too k — too 

If to ^ ^ then [to — e,to + e]r\ A~ for some £ > 0, and we would obtain /„j, (t„^. ) ^ — 00 
as supj _g<j<j^^g h{t) < in view of the continuity of h. This shows that Iq G A, and 
hence 

supg(i) > g(to) = limsup5„^ (t„ J 

> lim sup /„ Jt„ J = lim sup sup /„ (t ) 

fe— too n^oo tGK 

> liminf sup fn{t) > sup f/(t), 

teK teA 

which proves (4.3). 

For the proof of (4.4) we note that we have fn{t) = gti^) ^ .9^(0 f^i' teA and that 
fnit) ^ for < ^ A. In view of the uniform convergence of gn to g and < g^ we have 
a constant upper bound for the sequence. As we integrate over a finite interval we can 
now use dominated convergence. □ 

For the proof of the Cramer-von Mises part of the theorem we temporarily abbreviate 
A{F, G) to A. Using (4.4) together with an almost sure representation in the same way 
as we have used (4.3) for the proof of the Kolmogorov-Smirnov part we obtain 



T^n W / ^^.Jt)dt - T^^^^(r) / i?+(t)dt 

for all r < cx). For the tails of the integrals we need another auxiliary result. 
Lemma 2. If EX^^^ < 00 for some e>Q, then J^^ pF{t,t)^^'^ dt < 00. 
Proof. Using (2.3) we obtain 

pp{t,t) =va.r{{X ^t)+)<E{{X ~t)+f 
< a:'F{dx)<—EX^+^ 
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for all t> 0. This shows that PF{t, tY^"^ dt <oo, and the assertion follows in view of 
the fact that pp is bounded (indeed, pF{t,t) < EX^). □ 

The moment assumption in the second part of the theorem and Lemma 2 together 
imply Jp°° p{t, ty^'^ di < oo for p = pp, p = pa and p = pn (it is a straightforward conse- 
quence of (1.2) that finiteness of some moment of order 7 > 1 for G implies the same for 
F if F <icx G). It follows from 

ET^^^= f EB+{t)dt^^ f pH{t,ty'^dt< 00 

that TC^'^ is finite with probabihty one. Clearly, T'^^^{r) T^^^ almost surely as 
r — * 00. 

In order to obtain a uniform bound for the difference between (r) and we 

first note that < V+^„{t) < V;+„(0: where 

Clearly, EVm,n{t) = and var(V'„,„(i)) < pF{t,t) + PG{t,t), and hence EV^^^{t) < 
{pp{t,t) + pG{t,t)y^^ for all t <E A. Suppose now that e > and S > are given. We 
can then choose an r > such that 

poo 

/ {pp{t,t)+pG{t,t)f^^dt<e6, 

J r 

and Markov's inequality gives 

limsupP(|TCy„^(r)-TCyM| >,) < _ / {pp{t,t) + pG{t,t)f' dt < d. 

From this, the second assertion of the theorem now follows on using Theorem 4.2 in 
BiUingsley (1968). 



4.2. Proof of Theorem 2 

The first two terms on the right-hand side of (4.2) are stochastically bounded. If F <icx G 
does not hold, then F'^^{t) > G^^{t) for all t in some non-empty interval (a, b) C M. As 
Km,n 00, we therefore obtain 

/•oo 

SUpKn,„(t) ^ 00, / F+„(t)dt-^00 

t>0 Jo 

with probability one. 
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We mimic the proof of Theorem 1 and give a somewhat condensed argument. With J- as 
in Section 4.1, let U^ ,^ ^ and U^^ - (t/^ ^(0))^^^, 

be the empirical processes associated with the two parts of the resample. We need a state- 
ment analogous to (4.1), where now everything is conditional on the sequences (Xj)jgN 
and (Yfe)fcgN. Due to the size switching we cannot directly work with multiplier central 
limit theorems as, for example, in Chapter 3.6 of van der Vaart and Wellner (1996). In- 
stead we base our proof on a central limit theorem for empirical processes where the base 
distribution varies with the sample size. 

Lemma 3. With probability one, 

Un.tu Bh, Unu ^ Bh as m,n^ qo. 

distr ' distr 

Proof. It is obviously enough to prove the first part. Translated to the present situation 
the conditions (2.8.6) and (2.8.5) in Chapter 2.8.3 of van der Vaart and Wellner (1996) 
become 

limsupS(Z^_il,^^ i>ev/;?J}) = fo^' all £ > 

and 

lim sup \ Y&T[(j)s{ZN,i) - <pt{ZN,i))^^'^ - ^&'^{4>s{Zi) ~ (pt{Zi)f^'^\ = Q, 

where Zi is a random variable with distribution H . Again this is to be interpreted in the 
sense that it should hold with probability one, conditionally on the sequences (Xj)jgN 
and (Yfc)fcgN- 

For the first condition we can simply use the strong law of large numbers, together 
with the assumption that F and G have finite second moments. 
The second condition will follow from 

lim sn^\E4)t{ZN,i)~E(pt{Zi)\=Q (4.5) 

rn.n~*OQ ^>q 

and 

lim snx^\E(^t{ZN,i)(^s{ZN,i)-E(t)t{Zi)(^s{Zi)\=Q. (4.6) 
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Obviously, 



^ m 



N m ■ 
i=i 



Nn 

k=l 



and 



1 . 
N n ^-^ 

k=\ 

From this (4.5) and (4.6) follow as T and Ti :~ {<ps4>t :s,t>0} are easily seen to have 
finite bracketing numbers and thus are Glivenko-Cantelli classes; see Theorem 2.4.1 in 
van der Vaart and Wellner (1996). Now that we have checked the conditions the desired 
statement follows with Theorem 2.8.9 in van der Vaart and Wellner (1996). □ 

In analogy to (4.2) we now define the stochastic processes VN,m,n by 



\l m + n ' V m + n ' 

By the conditional independence of the subsamples, and using (2.4) again, we obtain 
from Lemma 3 that V/v,m,n — ^distr Bh with probability one. Because of 

t>0 

the first part of the theorem now follows on using the continuous mapping theorem. 

In the Cramer-von Mises case we start with a decomposition of the integral, as in 
the proof of Theorem 1. For the compact part we can again use the continuous mapping 
theorem. For the other part of the integral we need a bound for the conditional variances: 
We have 

var(Viv,m,«(i)) <PF„(i,t) +PG„(i,i), 
and, as in the proof of Lemma 2. 



pOO -1 -I poo 

Jo ^ ^ 



(-/2)di 
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and 

^ k=l " k=l •'^ 

SO we can use the strong law of large numbers to obtain an almost sure upper bound 
that does not depend on m and n. 



4.4. Proof of Theorem 4 

We need the following properties of the limit distributions of the test statistics in the 
case that we are on the boundary of the hypothesis. 

Lemma 4. If F ~ G, with F and G as in Theorem 1, then the distribution functions of 
T^^ and T^vM strictly increasing on [0,oo) and continuous on (0,oo). 

Proof. Proposition 2 in Bcran and Millar (1986) states that the distribution of the norm 
supj>Q |_B^f (i)| has a density and a strictly increasing distribution hmction on [0, oo). Thus 
for each a > we have 

p(supBH{t) <a] >p(sup\BH{t)\<a] > 0. 

\t>0 J \t>0 J 

From Theorem 1 and the remark on page 854 in Tsirelson (1975) we deduce that the 
assertion of the proposition is true in the Kolmogorov-Smirnov case. 

To obtain the corresponding assertion in the Cramer-von Mises case we argue as 
follows: Using Lemma 2 we obtain 

Ej^ \BH{t)\dt = E\BHit)\dt=^-J^ PH{t,tY'^dt<^, 

hence the process Bh has integrable sample paths with probability one. Thus we can 
regard Bh as a random element with values in a certain closed separable subspace of 
the Banacli space L^([0, oo), dt). Again by Proposition 2 in Beran and Millar (1986) the 
distribution of the norm \BHit) \ dt has a density and a strictly increasing distribution 
function on [0,oo). By the Hahn-Banach theorem there exists a sequence of functions 
(en)raGN C L°°{[Q, oo), dt) such that 

pOO POO 

/ \BH{t)\dt = snp BH{t)en{t)dt. 

Jo n>lJo 

Noting that B^{t) ^ ^{\BH{t)\ + BH{t)) we can then write 

/ B+(t)dt=isup / BH{t){en{t) + l)dt. 

Jo n>lJo 
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The random variables ^ BH{t){en{t) + l)dt, n S N, have centered normal distribu- 
tions. Since 



/ /"OO \ / /"OO 

P supi / BHit){enit) + l)dt<a) =P( B+{t)dt<, 

\n>l Jo J \J0 

>P(^^ \BHit)\dt<a^ >0 

for each a > wc can again apply the results in Tsirelson (1975) to see that the second 
assertion of the lemma is also true. □ 



Theorem 3 and the fact that the distribution function of the limit is strictly increasing 
together imply the convergence of the quantilcs, that is, 

S™'„(«) - c^'iH,a), cZ:^ia) -> c^^^iH,a) (4.7) 

almost surely. From this, the last statement, the consistency on the full alternative, 
follows on using Theorem 2. 

Suppose now that F <icx G; let us temporarily write Pks and PcvM for the true limiting 
distribution functions of the test statistics and Pks and PqvM for the limiting distribution 
functions for the bootstrap estimators for the distributions of the test statistics. In both 
cases the limits can be obtained via the supremum or integral of the process Bh, over 
the whole of R+ for Pks and PcvM and over the subset A{F, G) of K+ for Pks and PcvM- 
Since F and G are assumed to be non-degenerate, we have that the supremum and the 
integral of the process Bh over are bounded from below by BniO) and Bnit) dt, 
which shows that Pks and PcvM arc bounded from below in stochastic order by non- 
degenerate centered normal distributions. This implies that the quantilcs c^^{H,a) and 
c'~^^^{H,a) are positive due to our general assumption that a < 1/2. On the boundary 
of the hypothesis we have H = F = G, and Pks — PcvM and PcvM — PcvM- Hence, using 
the continuity of Pks and PcvM on (0,oo) 

hm P{T^%>c'^^^{a))=a 

m,n— >cx3 ' 

and 

hm PiT^:^>cg^^ia))=a, 

m,n — *oo 

which is the respective first statement in the theorem for the two tests. 

For the proof of the remaining assertions we first note that var(i?^f (t)) = for t > j{H), 
so wc may restrict the supremum or integral to S = A{F, G) D [0, j{H)). The distribution 
functions Pks and PcvM have a jump of size 1 at if S' = or £{S) = respectively, so 
that in this case 

hm P{T:^%>cl%{a))^0 
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and 

lim P(tCvM>5CvM(^))^o 

rn.n — >oo 



On the other hand, ii S ^ and £{S) > we argue as in the proof of Lemma 4 to 
obtain that the distribution functions i^KS a-nd -FcvM sltc strictly increasing on [0,oo) 
and continuous on the interval (0,oo). From this and Fks <st Fks and FcvM <st FcvM 
it follows with Theorem 1 that the limits 

^KS (a; F, G) = hm PiT^% > c^s^ (a) ) = 1 - Fks (c^' {H, a))<a 

m,n— *oo 

and 

V'CvM(a; F, G) = lim P{T^''^ > c^-^{a)) = 1 - FcvM(c^""(i/, a)) < a 

m,n — »cxD 

exist and are positive. 

4.5. Proof of Proposition 1 

The function 

fit) :=FSL(t)_G^L(t) 

j=i fc=i 

^ m 1 ^ 

= — max(Xw , i) maxfYfc , t) 

3 = 1 k=i 

is linear on the intervals [Zj, Zj+i), that is, of the form 

f{t) = aj + Pjt, t e [Zj,Zj+i), j = 1, 1, 
with certain real aj and (3j. Putting fj = f{Zj), j = 1, . . . ,1, we have 

Fm,n ~ l^m.,n ^lax / . 

l<j<L — 1 

Further, with J and K as in the statement of the proposition, we obtain the representa- 
tion 

h2 r + 2 \ 
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4.6. Proof of Theorem 5 

Let y be a bounded random variable and put X = EY; then X <icx Y by Jensen's 
inequality. Hence if "if satisfies (3.3) we must have '^/{EY) < E'i/{Y) for all bounded 
random variables Y. From this property it easily follows, by taking Y to be concentrated 
on two values, that is convex. Now let <!> be the inverse of 5*. Obviously, if (3.3) holds 
for 5*, then it also holds for <f>, so that <!> has to be convex, too. Now suppose that, for 
some Xi,X2 G M+, a G (0, 1), 

^'(axi + (1 - a)x2) < a'^ixi) + (1 - a)'if{x2). 
Then, with yi := 'l'(xi), i = 1,2, and using the fact that $ is strictly increasing, 

a$(yi) + (1 - a)$(y2) < ^{ayi + (1 - a)y2), 
which is a contradiction. Hence, 5* must satisfy 

5'(aa;i + (1 - a)x2) = a-^{xi) + (1 - a)*(.T2) 

for all xi,X2 E M+ and all a E (0, 1). From the general assumptions on it follows that 
^•(0) = 0. Taken together these properties imply that ^'(a;) = .t^'(I) for all x > 0. 
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